DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

نویسندگان

چکیده

Abstract Semantic segmentation in cataract surgery has a wide range of applications contributing to surgical outcome enhancement and clinical risk reduction. However, the varying issues segmenting different relevant structures these surgeries make designation unique network quite challenging. This paper proposes semantic network, termed DeepPyramid, that can deal with challenges using three novelties: (1) Pyramid View Fusion module which provides varying-angle global view surrounding region centering at each pixel position input convolutional feature map; (2) Deformable Reception enables deformable receptive field adapt geometric transformations object interest; (3) dedicated Loss adaptively supervises multi-scale maps. Combined, we show modules effectively boost performance, especially case transparency, deformability, scalability, blunt edges objects. We demonstrate our approach performs state-of-the-art level outperforms number existing methods large margin (\(3.66\%\) overall improvement intersection over union compared best rival approach).KeywordsCataract surgerySemantic segmentationSurgical data science

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation

CNN architectures have terrific recognition performance but rely on spatial pooling which makes it difficult to adapt them to tasks that require dense, pixel-accurate labeling. This paper makes two contributions: (1) We demonstrate that while the apparent spatial resolution of convolutional feature maps is low, the high-dimensional feature representation contains significant sub-pixel localizat...

متن کامل

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

We introduce a fast and efficient convolutional neural network, ESPNet, for semantic segmentation of high resolution images under resource constraints. ESPNet is based on a new convolutional module, efficient spatial pyramid (ESP), which is efficient in terms of computation, memory, and power. ESPNet is 22 times faster (on a standard GPU) and 180 times smaller than the stateof-the-art semantic ...

متن کامل

Segmentation Pyramid Classification

Ben Gorte Dept. of Geoinformatics ITC the Netherlands [email protected] Commission III, Working Group 3

متن کامل

Pyramid segmentation algorithms revisited

The main goal of this work is to compare pyramidal structures proposed to solve segmentation tasks. Segmentation algorithms based on regular and irregular pyramids are described, together with the data structures and decimation procedures which encode and manage the information in the pyramid. In order to compare the different segmentation algorithms, we have employed three types of quality mea...

متن کامل

Volumetric Semantic Segmentation using Pyramid Context Features Supplemental Material

Our “pyramid filtering” insight enables exact and extremely efficient per-voxel classification with minimal memory overhead. We will now demonstrate our improvement over existing techniques empirically and theoretically. For an empirical demonstration of efficiency, consider the following two alternatives to pyramid filtering: 1) the “sliding window” approach: iterate through every voxel in the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-16443-9_27